智能论文笔记

Robust Split Federated Learning for U-shaped Medical Image Networks

Ziyuan Yang , Yingyu Chen , Huijie Huangfu , Maosong Ran , Hui Wang , Xiaoxiao Li , Yi Zhang

分类：计算机视觉

2022-12-13

U-shaped networks are widely used in various medical image tasks, such as segmentation, restoration and reconstruction, but most of them usually rely on centralized learning and thus ignore privacy issues. To address the privacy concerns, federated learning (FL) and split learning (SL) have attracted increasing attention. However, it is hard for both FL and SL to balance the local computational cost, model privacy and parallel training simultaneously. To achieve this goal, in this paper, we propose Robust Split Federated Learning (RoS-FL) for U-shaped medical image networks, which is a novel hybrid learning paradigm of FL and SL. Previous works cannot preserve the data privacy, including the input, model parameters, label and output simultaneously. To effectively deal with all of them, we design a novel splitting method for U-shaped medical image networks, which splits the network into three parts hosted by different parties. Besides, the distributed learning methods usually suffer from a drift between local and global models caused by data heterogeneity. Based on this consideration, we propose a dynamic weight correction strategy (\textbf{DWCS}) to stabilize the training process and avoid model drift. Specifically, a weight correction loss is designed to quantify the drift between the models from two adjacent communication rounds. By minimizing this loss, a correction model is obtained. Then we treat the weighted sum of correction model and final round models as the result. The effectiveness of the proposed RoS-FL is supported by extensive experimental results on different tasks. Related codes will be released at https://github.com/Zi-YuanYang/RoS-FL.

translated by 谷歌翻译

Multi-view deep learning based molecule design and structural optimization accelerates the SARS-CoV-2 inhibitor discovery

Chao Pang , Yu Wang , Yi Jiang , Ruheng Wang , Ran Su , Leyi Wei

分类：机器学习

2022-12-03

In this work, we propose MEDICO, a Multi-viEw Deep generative model for molecule generation, structural optimization, and the SARS-CoV-2 Inhibitor disCOvery. To the best of our knowledge, MEDICO is the first-of-this-kind graph generative model that can generate molecular graphs similar to the structure of targeted molecules, with a multi-view representation learning framework to sufficiently and adaptively learn comprehensive structural semantics from targeted molecular topology and geometry. We show that our MEDICO significantly outperforms the state-of-the-art methods in generating valid, unique, and novel molecules under benchmarking comparisons. In particular, we showcase the multi-view deep learning model enables us to generate not only the molecules structurally similar to the targeted molecules but also the molecules with desired chemical properties, demonstrating the strong capability of our model in exploring the chemical space deeply. Moreover, case study results on targeted molecule generation for the SARS-CoV-2 main protease (Mpro) show that by integrating molecule docking into our model as chemical priori, we successfully generate new small molecules with desired drug-like properties for the Mpro, potentially accelerating the de novo design of Covid-19 drugs. Further, we apply MEDICO to the structural optimization of three well-known Mpro inhibitors (N3, 11a, and GC376) and achieve ~88% improvement in their binding affinity to Mpro, demonstrating the application value of our model for the development of therapeutics for SARS-CoV-2 infection.

translated by 谷歌翻译

Learned Smartphone ISP on Mobile GPUs with Deep Learning, Mobile AI & AIM 2022 Challenge: Report

Andrey Ignatov , Radu Timofte , Shuai Liu , Chaoyu Feng , Furui Bai , Xiaotao Wang , Lei Lei , Ziyao Yi , Yan Xiang , Zibin Liu

分类：计算机视觉

2022-11-07

The role of mobile cameras increased dramatically over the past few years, leading to more and more research in automatic image quality enhancement and RAW photo processing. In this Mobile AI challenge, the target was to develop an efficient end-to-end AI-based image signal processing (ISP) pipeline replacing the standard mobile ISPs that can run on modern smartphone GPUs using TensorFlow Lite. The participants were provided with a large-scale Fujifilm UltraISP dataset consisting of thousands of paired photos captured with a normal mobile camera sensor and a professional 102MP medium-format FujiFilm GFX100 camera. The runtime of the resulting models was evaluated on the Snapdragon's 8 Gen 1 GPU that provides excellent acceleration results for the majority of common deep learning ops. The proposed solutions are compatible with all recent mobile GPUs, being able to process Full HD photos in less than 20-50 milliseconds while achieving high fidelity results. A detailed description of all models developed in this challenge is provided in this paper.

translated by 谷歌翻译

PulseDL-II: A System-on-Chip Neural Network Accelerator for Timing and Energy Extraction of Nuclear Detector Signals

Pengcheng Ai , Zhi Deng , Yi Wang , Hui Gong , Xinchi Ran , Zijian Lang

分类：机器学习

2022-09-02

配备高速数字化器的前端电子设备正在使用并建议将来的核检测器。最近的文献表明，在处理来自核检测器的数字信号时，深度学习模型，尤其是一维卷积神经网络。模拟和实验证明了该领域神经网络的令人满意的准确性和其他好处。但是，仍需要研究特定的硬件加速在线操作。在这项工作中，我们介绍了Pulsedl-II，这是一种专门设计的，专门为事件功能（时间，能量等）从具有深度学习的脉冲中提取的应用。根据先前的版本，PULSEDL-II将RISC CPU纳入系统结构，以更好地功能灵活性和完整性。 SOC中的神经网络加速器采用三级（算术单元，处理元件，神经网络）层次结构，并促进数字设计的参数优化。此外，我们设计了一种量化方案和相关的实现方法（恢复和位移位），以在所选层类型的选定子集中与深度学习框架（例如Tensorflow）完全兼容。通过当前方案，支持神经网络的量化训练，并通过专用脚本自动将网络模型转换为RISC CPU软件，几乎没有准确性损失。我们在现场可编程门阵列（FPGA）上验证pulsedl-ii。最后，通过由直接数字合成（DDS）信号发生器和带有模数转换器（ADC）的FPGA开发板组成的实验设置进行系统验证。拟议的系统实现了60 PS的时间分辨率和0.40％的能量分辨率，在线神经网络推断在信号与噪声比（SNR）为47.4 dB时。

translated by 谷歌翻译

HTML版本

Generative Domain Adaptation for Face Anti-Spoofing

Qianyu Zhou , Ke-Yue Zhang , Taiping Yao , Ran Yi , Kekai Sheng , Shouhong Ding , Lizhuang Ma

分类：计算机视觉

2022-07-20

基于无监督的域适应性（UDA），由于目标情景的表现有希望的表现，面部抗散热器（FAS）方法引起了人们的注意。大多数现有的UDA FAS方法通常通过对齐语义高级功能的分布来拟合受过训练的模型。但是，对未标记的目标域的监督不足，低水平特征对齐降低了现有方法的性能。为了解决这些问题，我们提出了UDA FAS的新颖观点，该视角将目标数据直接适合于模型，即，通过图像翻译将目标数据风格化为源域样式，并进一步将风格化的数据提供给训练有素的数据分类的源模型。提出的生成域适应（GDA）框架结合了两个精心设计的一致性约束：1）域间神经统计量的一致性指导发生器缩小域间间隙。 2）双层语义一致性确保了风格化图像的语义质量。此外，我们提出了域内频谱混合物，以进一步扩大目标数据分布，以确保概括并减少域内间隙。广泛的实验和可视化证明了我们方法对最新方法的有效性。

translated by 谷歌翻译

Adaptive Mixture of Experts Learning for Generalizable Face Anti-Spoofing

Qianyu Zhou , Ke-Yue Zhang , Taiping Yao , Ran Yi , Shouhong Ding , Lizhuang Ma

分类：计算机视觉

2022-07-20

随着各种面部表现攻击不断出现，基于域概括（DG）的面部抗散热（FAS）方法引起了人们的注意。现有的基于DG的FAS方法始终捕获用于概括各种看不见域的域不变功能。但是，他们忽略了单个源域的歧视性特征和不同域的不同域特异性信息，并且训练有素的模型不足以适应各种看不见的域。为了解决这个问题，我们提出了专家学习（AMEL）框架的自适应混合物，该框架利用了特定于域的信息以适应性地在可见的源域和看不见的目标域之间建立链接，以进一步改善概括。具体而言，特定领域的专家（DSE）旨在研究歧视性和独特的域特异性特征，以作为对共同域不变特征的补充。此外，提出了动态专家聚合（DEA），以根据与看不见的目标域相关的域相关的每个源专家的互补信息来自适应地汇总信息。并结合元学习，这些模块合作，可适应各种看不见的目标域的有意义的特定于域特异性信息。广泛的实验和可视化证明了我们对最先进竞争者的方法的有效性。

translated by 谷歌翻译

Assessing the Effects of Hyperparameters on Knowledge Graph Embedding Quality

Oliver Lloyd , Yi Liu , Tom Gaunt

分类：人工智能 | 自然语言处理

2022-07-01

将知识图嵌入到低维空间中是将方法（例如链接预测或节点分类）应用于这些数据库的流行方法。就计算时间和空间而言，这种嵌入过程非常昂贵。其部分原因是对超参数的优化，涉及从大型超参数空间中反复，引导或蛮力选择反复采样，并测试所得嵌入的质量。但是，并非该搜索空间中的所有超参数都同样重要。实际上，在先验了解超参数的相对重要性的情况下，可以完全从搜索中消除一些，而不会显着影响输出嵌入的整体质量。为此，我们进行了SOBOL灵敏度分析，以评估调整不同超参数对嵌入质量方差的影响。这是通过进行数千个嵌入试验来实现的，每次测量不同的超参数构型产生的嵌入质量。我们使用此模型为每个高参数生成SOBOL灵敏度指数，对这些超参数配置的嵌入质量进行了回归。通过评估SOBOL指数之间的相关性，我们发现具有不同数据集特征的知识图之间的超参数敏感性的显着差异是这些不一致的可能原因。作为这项工作的另一个贡献，我们确定了UMLS知识图中的几个关系，这些关系可能会通过逆关系导致数据泄漏，并得出并存在该图的泄漏射击变体的UMLS-43。

translated by 谷歌翻译

CFNet: Learning Correlation Functions for One-Stage Panoptic Segmentation

Yifeng Chen , Wenqing Chu , Fangfang Wang , Ying Tai , Ran Yi , Zhenye Gan , Liang Yao , Chengjie Wang , Xi Li

分类：计算机视觉

2022-01-13

最近，在一步的Panoptic细分方法上越来越关注，旨在有效地旨在在完全卷积的管道内共同分割实例和材料。但是，大多数现有的工作直接向骨干功能提供给各种分段头，忽略语义和实例分割的需求不同：前者需要语义级别的判别功能，而后者需要跨实例可区分的功能。为了缓解这一点，我们建议首先预测用于增强骨干特征的不同位置之间的语义级和实例级相关性，然后分别将改进的鉴别特征馈送到相应的分割头中。具体地，我们将给定位置与所有位置之间的相关性组织为连续序列，并将其预测为整体。考虑到这种序列可以非常复杂，我们采用离散的傅里叶变换（DFT），一种可以近似由幅度和短语参数化的任意序列的工具。对于不同的任务，我们以完全卷积的方式从骨干网上生成这些参数，该参数通过相应的任务隐含地优化。结果，这些准确和一致的相关性有助于产生符合复杂的Panoptic细分任务的要求的合理辨别特征。为了验证我们的方法的有效性，我们对几个具有挑战性的Panoptic细分数据集进行实验，并以45.1美元\％PQ和ADE20K为32.6美元\％PQ实现最先进的绩效。

translated by 谷歌翻译

Exploiting Fine-grained Face Forgery Clues via Progressive Enhancement Learning

Qiqi Gu , Shen Chen , Taiping Yao , Yang Chen , Shouhong Ding , Ran Yi

分类：计算机视觉

2021-12-28

随着面部伪造技术的快速发展，由于安全问题，伪造的检测引起了越来越多的关注。现有方法尝试使用频率信息在高质量的锻造面上进行微妙的伪影。然而，频率信息的开发是粗糙的，更重要的是，他们的香草学习过程努力提取细粒度的伪造痕迹。为了解决这个问题，我们提出了一个渐进式增强学习框架来利用RGB和细粒度的频率线索。具体而言，我们对RGB图像进行细粒度分解，以在频率空间中完全删除真实的迹线和虚假的迹线。随后，我们提出了一种基于双分支网络的渐进式增强学习框架，结合自增强和互增强模块。自增强模块基于空间噪声增强和渠道注意，捕获不同输入空间中的迹线。通过在共享空间维度中通信，互增强模块同时增强RGB和频率特征。逐步增强过程有助于学习具有细粒面的伪造线索的歧视特征。在多个数据集上进行广泛的实验表明我们的方法优于最先进的面部伪造检测方法。

translated by 谷歌翻译

Domain Adaptive Semantic Segmentation via Regional Contrastive Consistency Regularization

Qianyu Zhou , Chuyun Zhuang , Ran Yi , Xuequan Lu , Lizhuang Ma

分类：计算机视觉

2021-10-11

近年来，对语义分割的无监督域适应性（UDA）进行了充分研究。但是，大多数现有的作品在很大程度上忽略了不同领域的本地区域一致性，并且对室外环境的变化的鲁棒性较低。在本文中，我们提出了一种新颖且完全端到端的可训练方法，称为域自适应语义分割的区域对比度一致性（RCCR）。我们的核心思想是从不同图像的相同位置提取的相似区域特征，即原始图像和增强图像，以更加接近，同时将两个图像的不同位置的特征推到要分开的不同位置。我们通过两种抽样策略提出了一个区域对比度损失，以实现有效的区域一致性。此外，我们呈现动力投影头，其中教师投射头是学生的指数移动平均值。最后，内存库机制旨在在不同的环境下学习更健壮和稳定的区域特征。对两个常见的UDA基准测试的广泛实验，即GTAV到CityScapes和CityScapes的合成，这表明我们的方法表现优于最先进的方法。

translated by 谷歌翻译